Main Page
Welcome to Leeroopedia
Your ML & Data Knowledge Wiki. Best practices and expert-level knowledge for Machine Learning and Data Engineering, covering 1000+ frameworks and libraries from training to deployment.
Browse implementation patterns, configuration guides, debugging heuristics, and battle-tested defaults for frameworks like vLLM, DeepSpeed, Megatron-LM, FlashAttention, Triton, Unsloth, LangChain, and many more. Every page is structured so both humans and AI agents can find what they need fast.
Connect your AI coding agent. Plug Leeroopedia into your favorite coding agent with the Leeroopedia MCP setup guide. Let it search docs, build plans, verify code, and diagnose failures on your behalf.
Go end-to-end. Leeroopedia gives your agent the knowledge. Kapso gives it the ability to act on it: research, experiment, and deploy.
Browse by Category
| Category | Description | Browse |
|---|---|---|
| Workflows | Step-by-step processes and procedures | Browse All |
| Principles | Core ideas and foundational knowledge | Browse All |
| Implementations | Code-level details and modules | Browse All |
| Heuristics | Best practices and guidelines | Browse All |
| Environments | Setup and configuration guides | Browse All |
Explore Pages
Workflows
- Workflow:Google deepmind Mujoco Model compilation and conversion
- Workflow:Datahub project Datahub Metadata Actions Pipeline
- Workflow:Apache Airflow Kubernetes Deployment via Helm
- Workflow:ARISE Initiative Robosuite Environment Setup And Simulation
- Workflow:Sgl project Sglang Offline Batch Inference
- Workflow:Rapidsai Cuml Sklearn Zero Code Acceleration
- Workflow:FlowiseAI Flowise Agentflow V2 Creation
- Workflow:Recommenders team Recommenders Neural Collaborative Filtering
- Workflow:Wandb Weave SDK Release
- Workflow:ChenghaoMou Text dedup SimHash Deduplication
Principles
- Principle:Tensorflow Tfjs Converter Installation
- Principle:Eventual Inc Daft Pandas Export
- Principle:DevExpress Testcafe CDP Browser Control
- Principle:Arize ai Phoenix Application Instrumentation
- Principle:Rapidsai Cuml Support Vector Machines
- Principle:Heibaiying BigData Notes HBase Data Deletion
- Principle:Iterative Dvc Plot Output Rendering
- Principle:Roboflow Rf detr Model Computational Profiling
- Principle:Sail sg LongSpec Distributed Training
- Principle:Mit han lab Llm awq NVILA Model Construction
Implementations
- Implementation:FMInference FlexLLMGen DeepSpeed Runtime Config
- Implementation:Treeverse LakeFS Java SDK Model IcebergLocalTable
- Implementation:Snorkel team Snorkel SlicingFunction Init
- Implementation:Risingwavelabs Risingwave Handle Create Source
- Implementation:ArroyoSystems Arroyo Debezium Extension
- Implementation:Explodinggradients Ragas RagasTracer Callbacks
- Implementation:NVIDIA TransformerEngine TE Distributed Checkpoint
- Implementation:Tensorflow Serving Multi Inference Test
- Implementation:Microsoft Semantic kernel InMemoryVectorStore
- Implementation:Langgenius Dify ModifyDocMetadata
Heuristics
- Heuristic:Roboflow Rf detr EMA Best Checkpoint Strategy
- Heuristic:Scikit learn Scikit learn Data Leakage Prevention
- Heuristic:Apache Airflow Variable Access Pattern
- Heuristic:Deepset ai Haystack Document Splitting Defaults
- Heuristic:Lm sys FastChat Tokenizer Offset Correction
- Heuristic:Nautechsystems Nautilus trader Parquet Row Group Tuning
- Heuristic:OpenGVLab InternVL Multi GPU ViT Device Mapping
- Heuristic:Microsoft Playwright Timeout Configuration Tips
- Heuristic:Scikit learn contrib Imbalanced learn KNeighbors Selection Tips
- Heuristic:Cypress io Cypress Xvfb Display Gotcha
Environments
- Environment:Onnx Onnx Cpp Build Environment
- Environment:Haosulab ManiSkill Real Robot LeRobot Deps
- Environment:Promptfoo Promptfoo Browser Automation
- Environment:Apache Paimon Cloud Storage Credentials
- Environment:Alibaba MNN ARM Mobile Environment
- Environment:Ray project Ray Python Runtime Environment
- Environment:Sgl project Sglang Triton
- Environment:Deepset ai Haystack HuggingFace Model Environment
- Environment:Openai Evals Optional Provider APIs
- Environment:Apache Dolphinscheduler Java Runtime